CDS

Accession Number TCMCG011C02578
gbkey CDS
Protein Id XP_021912147.1
Location join(3290..3427,3725..3862,4010..4059,4938..5046,5144..5259,5356..5473,5931..6047,6169..6260,6333..6399,6483..6564,6650..6741,6960..7103,7273..7419)
Gene LOC110825913
GeneID 110825913
Organism Carica papaya

Protein

Length 469aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA264084
db_source XM_022056455.1
Definition heparan-alpha-glucosaminide N-acetyltransferase [Carica papaya]

EGGNOG-MAPPER Annotation

COG_category S
Description Protein of unknown function (DUF1624)
KEGG_TC -
KEGG_Module M00078        [VIEW IN KEGG]
KEGG_Reaction R07815        [VIEW IN KEGG]
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K10532        [VIEW IN KEGG]
EC 2.3.1.78        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00531        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko04142        [VIEW IN KEGG]
map00531        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGACTGAGATAAAACCCGAATCACCGAAGGAGCAACGCTTGATTATTGCCCACCAACCCGATATCTCCGACCCCGACAAGCGTCTCAAACCCAAGCGCGTCGCTTCCCTTGACATCTTCAGAGGCCTCACTGTTGCTTTGATGATTTTGGTTGATGATGCTGGAGGGGAGTGGCCTGTGATTGGGCATGCGCCATGGAATGGCTGCAACCTTGCTGATTTTGTCATGCCCTTCTTCTTGTTCATTGTTGGCATGGCCATTGCCCTTGCCTTCAAGAGAATTTCAAGCCTGAGACTAGCTCTTGAAAAGGTGATTCTTAGACTCTGTTTTTTACTCTTTGTATTTTTGTTTCATGAAGGAGGTTTCTCACATGCACCTGACAAGTTAACTTATGGTGTTGATATGAAAATGATAAGGTGGTGTGGCATTCTTCAGAGAATAGCACTTACTTATTTGGTAGTGGCACTCATGGAAATCCTTATGAGAGATGCAGTGGCAAAGAATCTTTCATTTGGCAGGTTATCTATATTCAGGTTGTACTACTGGCATTGGCTAGTGGCTGCTTGTGTACTAGTTGTTTACTTTTGTGTTCTTTATAGTGCCTATGTACCGGATTGGCAATTCACTGTTCATGATGAGGATAGTTCCGATTATGGGAAGATTTTCACCGTATCATGTGGTGTGAGGGGAAAACTTAATCCTCCTTGCAATGCTGTTGGATTTGTTGACAGAGCAGTATTGGGAATCAATCATATGTATAAACATCCTGCATGGAGGAGATCTAAGGAATGCACTCAGAATTCCCCGTATTCAGGACCTTTCCGAAATAATGCTCCATCATGGTGCTATGCACCTTTTGAACCTGAAGGAATTTTAAGCTCCATATCTTCTGTTCTTTCTACAATTATCGGAGTGCATTTTGGAAATGTGCTTATACATTTCAAGGAGCATTTAGCTAGACTGAAGCATTGGATGATAATGGGAACTTCTCTCCTTGTTTTAGGACTTGCTTTACATTTTACTCATGCCATTCCTTTCAACAAACAATTATACACTTTTAGCTATGTTTGTGTAACTTCTGGAGCGGCAGCACTGGTTTTTTCTGCCATCTACATACTGGTTGATATTTTGGATTTCAAGTATATGTTTCTGCCATTTGAGTGGATTGGCATGAACGCTATGCTTGTTTACGTTATGGCAGCAGAAGGAATTTTTGCTGGCTTCATTAACGGATGGTTTTACGATGATCCACACAATACACTGATACACTGGATTCAGAAGCACGTATTCGTTGGAGTCTGGCACTCGCAAAGAGTAGGCATTCTGCTTTATGTCATCTTCGCGGAGATCTTCTTCTGGGGCATCGTTGCAGGCATTTTCCACAGGATGAATATCTATTGGAAGCTTTAG
Protein:  
MTEIKPESPKEQRLIIAHQPDISDPDKRLKPKRVASLDIFRGLTVALMILVDDAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIALAFKRISSLRLALEKVILRLCFLLFVFLFHEGGFSHAPDKLTYGVDMKMIRWCGILQRIALTYLVVALMEILMRDAVAKNLSFGRLSIFRLYYWHWLVAACVLVVYFCVLYSAYVPDWQFTVHDEDSSDYGKIFTVSCGVRGKLNPPCNAVGFVDRAVLGINHMYKHPAWRRSKECTQNSPYSGPFRNNAPSWCYAPFEPEGILSSISSVLSTIIGVHFGNVLIHFKEHLARLKHWMIMGTSLLVLGLALHFTHAIPFNKQLYTFSYVCVTSGAAALVFSAIYILVDILDFKYMFLPFEWIGMNAMLVYVMAAEGIFAGFINGWFYDDPHNTLIHWIQKHVFVGVWHSQRVGILLYVIFAEIFFWGIVAGIFHRMNIYWKL